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(57) Abstract: A method for estimating a receiver's location (X) in a wireless communication environment (RN) having several 
channels. Each channel has at least one signal parameter (V) that varies with location (X) differently from the other channels. A 
set of calibration data (CD) is determined for each calibration point, each set comprising the location (X) and at least one measured 
signal parameter (V) for each of several channels. The calibration data (CD) serve as a basis for a statistical model (SM) of the signal 
parameters (V) versus a receiver's location. A set of observed signal parameters (CO) is determined, the set comprising at least one 
signal parameter (V) for each of several channels at the receiver's location (X). A location estimate (LE) approximating the location 
(X) of the receiver (R) is determined on the basis of the statistical model (SM) and the set of observed signal parameters (CO). 
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Location estimation in wireless telecommunication networks 

Background of the invention 

The invention relates to methods and equipment for estimating a 
receiver's location in a wireless telecommunication environment, ie one or 

5 more networks which may be radio, microwave or optical networks. The one or 
more networks communicate at a plurality of channels simultaneously. Such a 
location estimation can be used to provide a wide variety of location- 
dependent services. 

US patent 6 112095 to Mati Wax et al. discloses a method for pro- 

10 viding a set of likely locations of a transmitter in a cellular network, such as 
AMPS or CDMA. A problem with the technique disclosed in the Wax patent is 
that it requires additional hardware at the network side, such as an antenna ar- 
ray which is equipped to measure an angular direction relative to a base sta- 
tion. In other words, to determine a mobile station's location, information on 

15 the network infrastructure must be available and the mobile station must 
transmit something to have its location estimated. 

Disclosure of the invention 

An object of the invention is to solve the above problems. In other 
words, the mechanism according to the invention should be able to estimate a 
20 receiver's location in a wireless telecommunication network even without prior 
knowledge of the network infrastructure (such as the locations of the base sta- 
tions). 

This object is achieved with a method and equipment which are 
characterized by what is disclosed in the attached independent claims. Pre- 
25 ferred embodiments of the invention are disclosed in the attached dependent 
claims. 

The invention is based on the surprising idea that it is possible to 
estimate a receiver's location with reasonable confidence without knowledge 
of the infrastructure of the receiver's wireless environment, ie the network(s) 

30 received by the receiver. For example, the technique disclosed in the above- 
referenced Wax patent relies on the cellular network's base station configura- 
tion, including the location of the base stations. It is indeed surprising that the 
technique according to the invention is feasible. The fact that it is surprising is 
apparent as soon as one walks around with a mobile phone having a field 

35 strength indicator. In some places, a shift of 20 to 30 cm changes the field 
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strength dramatically. Evidently, there must be a vast number of locations with 
near-identical field strength. One would expect that calibrating a location esti- 
mation system requires field strength (or other signal parameter) measure- 
ments at locations very close to each other, and that huge databases would be 
5 required to store these measurements. Atmospheric conditions, cityscapes 
and network configurations change continuously. At first sight, it would seem 
that the databases will deteriorate rapidly, unless constantly updated. How- 
ever, computer simulations show that a technique based on measurements at 
several channels (frequencies) is surprisingly robust. Also, calibration data can 

10 be collected automatically at various conditions. 

One aspect of the invention is a method for estimating a location of 
a receiver in a wireless telecommunication environment, the telecommunica- 
tion environment comprising a plurality of channels for simultaneous commu- 
nication, each channel having at least one signal parameter that varies with lo- 

15 cation differently from the other channels. The method can be implemented by 
the following steps: 

1) for each of a plurality of calibration points in the wireless tele- 
communication environment, determining a set of calibration data, each set of 
calibration data comprising the location of the respective calibration point and 

20 at least one measured signal parameter for each of several channels at that 
calibration point; 

2) on the basis of the sets of calibration data, maintaining a statisti- 
cal model of the signal parameters of the several channels versus a receiver's 
location in the wireless telecommunication network; 

25 3) measuring at least one signal parameter for each of several 

channels at the receiver; and 

4) estimating the location of the receiver on the basis of the statisti- 
cal model and the measured signal parameters of the several channels at the 
receiver. 

30 Another aspect of the invention is an arrangement for carrying out 

the above method. The arrangement can be embodied as a receiver compris- 
ing means for determining sets of observed signal parameters, each set com- 
prising at least one observed signal parameter for each of several channels at 
the location of the receiver. The receiver may itself comprise a location calcu- 

35 lation module for determining a location estimate approximating the location of 
the receiver on the basis of said sets and a statistical model of the signal pa- 
rameters of the several channels versus a receiver's location in a wireless 
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telecommunication environment Alternatively, the receiver may convey the 
sets to an external location calculation module. 

The term 'receiver 1 means that the device whose location is being 
estimated does not have to transmit when its location is being estimated. In 

5 other words, it suffices that the device is making observations of its wireless 
environment. For example, a GSM phone does not have to receive a traffic 
channel. Rather it makes observations at all available frequencies. The device 
may also have, and typically has, transmitting ability, but it is not necessary for 
all embodiments of the invention, and the invention can be used to estimate 

10 the location of a pager or a broadcast receiver. Because transmission capabil- 
ity is not essential to location estimation according to the invention, the re- 
ceiver may exploit signal parameters of networks it is not attached to. For ex- 
ample, a GSM phone attached to one GSM network may exploit signal 
strength values of other GSM networks. 

15 The term 'environment' means that the receiver can receive (make 

observations of) at least one network, but it can receive more than one. For 
example, a GSM phone may observe several operators' GSM networks. A 
more advanced receiver may observe many types of networks, such as cellu- 
lar networks and broadcast networks. 

20 A 'wireless 1 environment means that the one or more networks may 

be radio, microwave or optical networks. Also, the set of networks received by 
the receiver must communicate at a plurality of channels simultaneously, and 
the plurality of channels must comprise a subset of channels such that each 
channel in the subset has at least one signal parameter that varies with loca- 

25 tion differently from the other channels in the subset. This means that several 
channels having signal parameters with near-identical dependence from loca- 
tion, such as channels from a common transmitting antenna, do not normally 
give sufficient information for reliable location estimation. Normally, signals 
from at least three transmitting stations are required. Examples of suitable 

30 networks are cellular networks (such as GSM, GPRS, UMTS, etc.), broadcast 
networks (analogue audio, DAB or DVB), wireless local-area networks (WLAN) 
or short-range microwave networks, such as Bluetooth. 

A 'location' may have one to three dimensions. A one-dimensional 
presentation of location may be sufficient in trains and the like. Two- or three- 

35 dimensional presentations of location are much more useful, however. In a 
two-dimensional presentation, the receiver is assumed to be substantially at 
earth level. Actually the height does not matter as long as the calibration data 



WO 02/054813 



PCT/FI01/01151 



4 

is measured at the same height (such as ground level, 13th floor, etc.) as the 
actual observations. Additionally, the calibration data may comprise a presen- 
tation of time. This means that the wireless environment, ie its signal parame- 
ters, vary with time. In other words, the calibration data comprises, in addition 

5 to the signal parameters, one to three location coordinates and, optionally, a 
presentation of time. 

The term 'calibration data', as used herein, comprises calibration 
measurements (ie, measured signal values) and the location (and, optionally, 
time) at which the measurements were made. 

10 The term 'statistical model' means that the individual sets of calibra- 

tion data are not needed to calculate an individual receiver's location. The dif- 
ference between a statistical model and the sets of calibration data can be il- 
lustrated by the following example. Assume that we have a number of {x, y} 
pairs such that there is some dependence between x and y. The y value at a 

15 location x can be calculated on the basis of all the {x, y} pairs. A much faster 
way to predict the value of y given a value of x is to calculate a mathematical 
function y = f(x). In this example, the function f is the statistical model. In other 
words, the value of y given a value of x is calculated without reference to the 
individual {x, y} pairs. Location estimation on the basis of the statistical model 

.20 is faster and requires less storage space than location estimation on the basis 
of the individual sets of calibration data. 

The statistical model can have a large variety of different implemen- 
tations, such as probabilistic models, neural networks,, fuzzy-logic systems, 
kernel estimators, support vector machines, decision trees, regression trees, 

25 Kalman filters and other statistical filtering methods, wavelets, splines, induc- 
tive logic programming methods, finite mixture models, hidden Markov models, 
etc. As used in this context, the term 'statistical model' may also refer to a mix- 
ture of several statistical (sub)models. 

The term 'channel' should have a wide interpretation, meaning 

30 more or less the same as a frequency or frequency band. The receiver does 
not have to communicate on the channel, as long as the receiver (or an at- 
tached measuring apparatus) can measure at least one signal parameter of 
that channel. In TDMA systems, each frequency has several timeslots, each of 
which carries one channel. As far as the invention is concerned, all timeslots 

35 having the same frequency give identical information, and any one of them 
can be used as a 'channel'. If the measured signal parameter is signal 
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strength, the receiver does not even have to be able to interpret the contents 
of the channel. 

An illustrative but non-exhaustive list of the signal parameters vary- 
ing with location comprises signal strength, timing advance and error ratio. The 
5 list may also comprise the availability of certain channels, but this can be seen 
as a special case in which the signal strength and/or error ratio is quantified to 
a yes/no question. If directional antennas are used, the direction of the radio 
beam(s) can be used as well. Thus the measured signal parameters do not 
have to correspond to a certain channel, but they can be derived values. For 

10 example, a measured parameter set may be or comprise a vector V = [V1 , V2, 
V3...] in which V1, V2 etc. are the indices of the best, second best, etc. avail- 
able channel. For the purposes of clarity, however, we will use examples in 
which the signal parameters are related to certain channels. 

Each set of calibration data comprises the location of the respective 

15 calibration point and at least one measured signal parameter for each of sev- 
eral channels at that calibration point. Calibration points are points whose 
location and signal parameters are known or measured. The calibration meas- 
urements are typically determined by means of fixed and/or mobile calibration 
receivers. Fixed calibration receivers can be attached to buildings, traffic signs, 

20 lampposts and the like. Mobile calibration receivers can be transported with 
persons or in vehicles. The calibration receivers measure the signal parame- 
ters like the actual receivers do. The measured signal parameters can be 
transferred to the statistical model via wired or wireless transmission (=on-line) 
or by moving a detachable media, such as a memory disk, tape or card (=off- 

25 line). 

Location estimation can take place at the receiver site or at the 
network site. If the location is estimated at the receiver site, the receiver (or an 
attached computer) must have access to the statistical model. With current 
technology, a feasible statistical model can be compressed to a size which is 

30 manageable in a laptop or palmtop computer. The model can be updated 
while the computer is connected to the Internet, for example. Alternatively, the 
model can be supplied on a detachable memory, such as a CD-ROM or DVD- 
ROM. In the future, even a mobile phone will have sufficient memory for hold- 
ing the statistical model. The model can be updated by means of a data call 

35 via a fast connection, for example. If the receiver site stores a copy of the sta- 
tistical model, it needs no transmission capability, and the actual receiver can 
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be a broadcast receiver, a pager or a dedicated add-on card for a laptop com- 
puter, similar in appearance to current GSM attachment cards for laptops. 

Alternatively, the receiver may be part of a transceiver, such as a 
mobile phone or a WLAN or Bluetooth interface attached to a portable or 

5 handheld computer. In this case, the transceiver may send the measurement 
results to the network which forwards the results to a location server. Depend- 
ing on the type of transceiver, the measurements can be sent in a short mes- 
sage, via a data call or a WAP or WLAN connection, for example. The location 
server can send the transceiver its location estimate over a similar connection. 

10 According to a preferred embodiment of the invention, the signal 

parameter measurements (the calibration measurements and/or the receiver's 
current observations) are quantified to a relatively small number of classes, 
such as two to five classes. In other words, the granularity of the measure- 
ments is increased. At first sight, such granularity increase seems to lose in- 

15 formation. For instance, assume that the signal strength of a certain channel at 
a certain location is 34 units on a scale of 0 to 100 (the actual unit is irrele- 
vant). Instead of storing the result of 34 units, we only store the fact that the 
measurement was between 25 and 50, ie a value of 1 on a scale of 0 to 3. It 
would seem that a value of 34 on a scale of 0 - 100 can better predict the sig- 

20 nal strength in the neighbourhood of that location than a value of 1 on a scale 
of 0 to 3 does. However, in many cases increased granularity results in in- 
creased location accuracy. One reason for this is that on a high-resolution 
scale, there are many values that occur relatively seldom, whereas on a low- 
resolution scale, all possible values occur relatively frequently. 

25 An advantage of the invention is that prior information on the net- 

work infrastructure is not necessary (although it may be useful). This means 
that a location service according to the invention is not tied to network opera- 
tors. Even if the location service according to the invention is maintained by a 
network operator, that operator can exploit observations from other operators* 

30 networks without prior information on their infrastructure. The invention is ap- 
plicable in a wide variety of network techniques, such as cellular networks, 
broadcast networks or wireless local-area networks. 

Brief description of the drawings 

The invention will be described in more detail by means of preferred 
35 embodiments with reference to the appended drawing wherein: 
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Figure 1 illustrates various graphs of signal parameter versus re- 
ceiver location; 

Figure 2 is a block diagram illustrating the general concept of the 

invention; 

5 Figure 3 is a block diagram illustrating a typical calibration receiver 

for determining calibration measurements; 

Figures 4A and 4B are block diagrams illustrating mobile receivers 
whose location is to be estimated; and 

Figure 5 illustrates the structure of a statistical model. 

10 Detailed description of the invention 

Figure 1 illustrates various graphs of signal parameter versus re- 
ceiver location. The horizontal axis represents the (one-dimensional) location 
of a receiver. The vertical axis represents a signal parameter V (such as signal 
strength, or error ratio) measured by a receiver. Graphs A and B depict signal 

15 parameters of two channels. In this hypothetical example, we have 10 data 
points Di to D10 measured at location X<i to X 10> respectively. Both graphs A 
and B share the data points Di to Di 0 having the respective locations Xi to X i0 
and the signal parameter value V 0 . Figure 1 gives a faint idea of the difficulties 
in implementing the invention. Not only is the parameter value V 0 common to 

20 10 different locations (in this example), but the 10 locations could be explained 
equally well by both graphs A and B. The well-known Nyquist criterion states 
that a signal can be fully reconstructed if sampled at more than twice its high- 
est frequency component. If the graphs A and B represent, say, field strength 
in a GSM network having a nominal frequency of 900 MHz, the spatial fre- 

25 quency of the graphs A and B has a wavelength of approximately 30 cm. Ac- 
cordingly, the signal parameters should be sampled at points less than 15 cm 
apart, which is clearly an impossible task. But if the signal parameters are 
sampled at points more than half a wavelength apart, the graphs A and B can- 
not be reconstructed, as evidenced by the fact that between points X6 and X10 

30 the graphs A and B have no similarity whatsoever. 

The reason that the present invention works in practice stems from 
the fact that as the number of channels increases, the number of locations 
where the channels behave as described above decreases rapidly, and so it 
becomes increasingly unlikely that any two points cannot be distinguished 

35 from each other based on the measured parameters. 
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Figure 2 is a block diagram illustrating the general concept of the 
invention. In Figure 2, the invention is implemented as a compact location es- 
timation module LEM, although more distributed implementations are equally 
possible. An essential feature of the invention is a statistical model SM of the 
5 receiver's wireless environment, the model being able to predict the receiver's 
location given a plurality of current observations at the receiver site. The statis- 
tical model SM is built and maintained by a model construction module MCM, 
on the basis of calibration data CD and, optionally, on the basis of prior infor- 
mation PI of the wireless environment. The optional prior information PI may 

10 comprise information on network infrastructure, such as the locations and ra- 
dio parameters of base stations. The locations at which calibration measure- 
ments are collected are called calibration points. The calibration data CD 
comprises data records each of which comprises the location X of the calibra- 
tion point in question and the set of signal parameters V measured at that cali- 

15 bration point. Optionally, the calibration data records may also comprise the 
time at which the measurement was made, in case the signal parameters vary 
with time. The location X can be expressed in any absolute or relative coordi- 
nate system. In special cases, such as trains, highways, tunnels, waterways or 
the like, a single coordinate may be sufficient, but normally two or three co- 

20 ordinates will be used. The reference sign X denotes the set of all coordinates 
of the location. 

It should be noted that the term 'training data 1 is often used in the 
context of such statistical models. In the context of this invention, the term 
'calibration' is preferred, because 'training' may convey the idea that the model 
25 is ready after initial training, whereas 'calibration' better conveys the idea that 
the model may have to be updated regularly as the conditions change. 

There is also a location calculation module LCM for producing a lo- 
cation estimate LE on the basis of the receiver's current observations CO and 
the statistical model SM. Technically, the 'measurements' and 'observations' 
30 can be performed similarly, but to avoid confusion, the term 'measurement' is 
generally used for the calibration measurements, and the signal parameters 
obtained at the current location of the receiver are called 'observations'. The 
receiver's most recent set of observations is called current observations. The 
location calculation module LCM or a separate estimate interpretation module 
EIM may also use the receiver's observation history OH to interpret the loca- 
tion estimate. In other words, the observation history OH can be used to re- 
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solve ambiguities in cases where a set of observations can be explained by 
two or more locations with substantially equal probability. 

Figure 3 is a block diagram illustrating a typical calibration receiver 
CR for determining the calibration measurements in the calibration data CD 
5 shown in Figure 2. Figure 3 shows a mobile calibration receiver comprising a 
portable computer (or data processor) PC-C, a mobile station MS-C (such as a 
GSM, GPRS or UMTS mobile phone) and a location receiver, such as a GPS 
(global positioning system) device. The suffixes -C stand for calibration re- 
ceiver, to distinguish the corresponding parts of the actual receiver R in Figure 

10 4. For clarity, the calibration receiver's main modules PC-C, MS-C and LR are 
shown separately, although the two latter modules are available as PC cards 
which can be inserted into a card socket in a typical laptop computer. The 
calibration receiver CR observes the radio signal parameters of the available 
base stations BS in a cellular radio network RN. The interface between the ra- 

15 dio network RN and the mobile station MS-C is called a radio interface Rl. If 
the radio interface Rl is bidirectional, the calibration receiver CR may send its 
observations to the location estimation module LEM via the same radio inter- 
face Rl. Alternatively, the calibration receiver's portable computer PC-C may 
store the observations on a detachable memory DM medium, such as a re- 

20 cordable CD-ROM disk, which is later brought off-line to the location estima- 
tion module LEM. 

The location receiver LR of the calibration receiver CR can be en- 
tirely conventional, for example a commercial GPS (global positioning system) 
receiver, as long as it can output the measured coordinates to an attached 

25 computer or other data processor. The portable computer can also be a con- 
ventional, suitably programmed computer. Only the mobile station MS-C may 
need modifications to its hardware or firmware (its ROM contents). Modifica- 
tions may be needed, depending on how many signal parameters the mobile 
station measures. For example, a conventional GSM phone monitors, in addi- 

30 tion to its currently active cell, some parameters of its neighbouring cells, but 
the neighbouring cells are not measured as extensively as the active cell. Only 
when a GSM phone is having an active call, does it monitor the neighbouring 
cells as extensively as its active cell. For the purposes of the invention, it 
would be beneficial to modify the mobile station's cell monitoring routines such 

35 that it monitors the available cells as extensively as possible. 
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Naturally, the calibration receiver CR can comprise more than one 
mobile station for monitoring different types of networks or different operator's 
network. For monitoring broadcast networks, the calibration receiver CR 
should also comprise a scanning broadcast receiver (not shown separately). 
5 Alternatively, the mobile station MS can be a multi-mode device capable of re- 
ceiving cellular networks and broadcast networks. 

Calibration receivers, like the one shown in Figure 3, can be earned 
along in vehicles or with persons. Fixed calibration receivers, which do not 
need a GPS receiver, can be attached to buildings, traffic signs, lampposts 

10 and the like. As an alternative to using a separate location receiver, the loca- 
tion of the calibration receiver can be determined by one or more of the follow- 
ing techniques: showing the receiver's location on a digitized map; entering a 
street (or other) address and converting it to a location by means of a suitable 
database; or using other known locations, such as stops of public vehicles. 

15 Figure 4A is a block diagram illustrating a typical mobile receiver 

whose location is to be estimated. A simple embodiment of a receiver R com- 
prises only a suitably programmed mobile station MS. For some embodiments, 
the receiver R may also comprise a portable computer (or data processor) PC. 
Again, the term 'receiver' implies that the device is receiving when its location 

20 is being estimated although, in practice, most embodiments will also have 
transmitting capability. The embodiment shown in Figure 4A does not contain 
the statistical model SM. Accordingly, the receiver R must send its current ob- 
servation set CO to the location estimation module LEM via the base station 
BS it is connected to. The location estimation module LEM returns the receiver 

25 its location estimate LE via the radio interface Rl. 

Figure 4B shows an alternative embodiment in which the receiver's 
attached computer PC receives a copy of the statistical model SM on a de- 
tachable memory DM, such as a CD-ROM disk, and the receiver is able to de- 
termine its own location without transmitting anything. As a yet further alterna- 

30 tive (not shown separately), the receiver's attached computer PC may receive 
the statistical model via an Internet (or any other data) connection to the loca- 
tion, estimation module LEM. Future wideband mobile stations may be able to 
receive the statistical model via the radio interface RL A hybrid of the tech- 
nologies may also be used such that the receiver receives an initial statistical 

35 model via a wired connection or on the detachable memory, but later updates 
to the model are sent via the radio interface. 
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Note that in Figures 3, 4A and 4B f the radio network RN is shown 
as a cellular network and the mobile stations MS resemble cellular handsets. 
The invention is not limited to cellular networks, however, and can equally well 
be used in a WLAN environment, in which case the mobile stations are re- 
5 placed by WLAN interface devices. 

Statistical modelling 

Possible statistical models will now be studied in more detail. In 
general, a statistical model, as used in this context, can comprise several indi- 
vidual statistical submodels, in which case the actual estimate is obtained by 

10 combining the individual results of the submodels. 

There are many possible statistical modelling approaches that can 
be used for producing the required statistical submodels. In the following we 
will focus on the probabilistic approach. A probabilistic model means that when 
estimating the location of the mobile terminal, the result is represented as a 

15 probability distribution over the possible locations if location X is modelled as a 
discrete variable, whereas, if the location X is modelled as a continuous vari- 
able, the result is represented as a density function. In the following, the focus 
will be on the discrete case. Similarly, the location-dependent measurements 
V can also be modelled either with discrete or continuous observational vari- 

20 ables. The number of dimensions of vector V (the number of measurements 
that can be obtained) varies and depends on the properties of the operating 
wireless network(s). 

Again, there are many probabilistic model classes that can be used. 
In the following preferred embodiment of the invention, the focus will be on pa- 

25 rametric probabilistic models. In this case a single model can be represented 
as a pair (M,0), where M denotes the model structure, ie the qualitative proper- 
ties of the model that determine which parameters are required, and 0 denotes 
the quantitative values of the parameters. 

There are two principal approaches for constructing parametric 

30 probabilistic models (M, 0) in the present context, namely conditional models 
and joint models. Conditional models are models that directly give probability 
distributions of the form P(X | V,M, 0), where V denotes the values of the ob- 
servational variables (for example, a vector consisting of signal strength 
measurements), and X denotes the location where observation V was made. 

35 Joint models define probability distributions P(X,V | M, 0) on events (X,V). 
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However, by using the axioms of probability theory we can see that 
P(X | V t M, 9) = P(X ,V | M, 0)/P(V | M, 9), where P(V | M, G) does not depend 
on the location X. Thus we can treat the denominator P(V | M, 0) as a normal- 
izing constant. This means that we can always use a joint model for condi- 
5 tional modelling, and in the following we will focus on joint modelling and re- 
gard conditional modelling as a special case. 

There are many ways to use parametric models in location estima- 
tion. Let us first assume that we have decided to use a single model structure 
M, and we wish to determine the parameters from the calibration data CD so 
10 that we get a joint probabilistic model for events (X,V), which also gives, as 
described above, the required conditional distribution for location X given the 
observations V. As described in [Kontkanen et al. 2000], there are several al- 
ternatives for producing the joint distribution: 

1. We can use P(X, V | M, 9(D)), where 0(D) is the maximum likelihood instan- 
15 tiation of the parameters, ie 9(D) = arg max P(D | M, 9). 

2. We can use P(X,V | M, 9(D)), where 0(D) is the Bayesian maximum poste- 
rior instantiation of the parameters, ie 0(D) = arg max P(0 | M, D) 

3. We can use P(X,V | M, 9(D)), where 9(D) is the mean of the posterior distri- 
bution P(9 | M, D). 

20 4. We can integrate over the parameters 9: P(X, V | D,M) = JP(X,V | D,M, 9)P(9 
| D,M)d9. 

5. We can use P(X, V | M, 9(D)), where 9(D) is the parameter instantiation op- 
timizing the minimum message length criterion described in [Wallace and 
Dowe 1999] and the references therein. 
25 In some special cases, alternatives 3 and 4 are equivalent. 

In general, one may wish to use several model structures M. In the 
following, we will assume that we have fixed the general model family (set) F, 
the set of all the possible model structures under consideration. For example, 
the set F may correspond to the set of all possible Bayesian network models 
30 (see [Cowell et al. 1999], [Pearl 1988]). In this case we produce the predictive 
distribution P(X |V,F) by computing a weighted sum over all the models in F: 
P(X |V,F) oc 2 P(X , V | M) W(M). Possible weighting functions W include the 
following: 

1. The posterior of the model structure M, given the data: 
35 P(M | D) oc P(D | M)P(M) = P(M) J P(D |9,M)P(9 | M)d9. 
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2. The stochastic complexity of the data, given the model structure M, and the 
approximations of the stochastic complexity criterion, discussed in 
[Rissanen 1999] and the references therein. 

3. The minimum message length of the data, given the model structure M, and 
5 the approximations of the MML criterion, discussed in [Wallace and Dowe 

1999] and the references therein. 

It is also possible to use conditional (supervised) versions of the 
weighting functions, in which case the weights are computed with respect to 
conditional modelling, and the actual data is taken to consist of only the values 

10 of the location variable X, and the measurement data V is treated as "back- 
ground data". These alternatives are discussed in [Kontkanen et al 1999]. 

If the number of model structures M in F is too high for computing 
the weighted sum in a feasible time, we have to restrict the model family F by 
performing a search in F, and pruning F to consist of only those model struc- 

15 tures that are the best with respect to some cost function. The possible cost 
functions for performing the search include the weight functions listed above. 
Any search algorithm can be exploited in this task. An extreme case of this 
type of restricting search is a case where only one single model structure M in 
F is chosen. In other words, the sum over model structures reduces to a single 

20 term corresponding to the use of a single model with the largest weight. 

If the observations V are modelled as discrete variables, the granu- 
larity of the discrete variables can be viewed as part of the model structure M. 
The granularity can either be fixed by the user (representing prior information), 
or as part of the model structure M, it can be learned from the calibration data. 

25 The optional prior information, such as information on the locations 

and radio parameters of the base stations, represents knowledge other than 
that extracted from the calibration measurements. In the probabilistic setting, 
we can identify the following ways for coding the prior information: 

1 . By choosing the initial model family F of probability models (determining the 
30 model structures considered, and with each model structure, the forms of 

the distributions used and the assumptions made). 

2. If the observational variables V are taken to be discrete, by choosing the 
granularity of the discretization. 

3. If the location variable X is taken to be discrete, by choosing the granularity 
35 of the discretization. 
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4. By determining the prior distribution P(8 | M) for the parameters of the 
model M. 

5. By determining the prior distribution P(M) for the model structures M in the 
family F. 

5 Missing data 

There are several alternative procedures for handling missing data: 

1 . Treat 'missing' as an extra value for the variable in question. 

2. Ignore the missing entries (the sufficient statistics are computed from the 
existing data only) 

10 3. Estimate the missing values from the existing data and/or prior information. 
The estimates can either be used for filling in educated guesses of the 
missing values, or they can be treated as partial observations (sufficient sta- 
tistics of several possible values can be simultaneously partially updated, 
according to, e.g., their estimated probabilities). 

15 4. Fill in the missing values by using random guesses. 

Location interpretation and reporting 

The result of the probabilistic location estimation can be reported in 
several different ways. First, we can divide the working area into several sub- 
areas in different ways: the subareas can either form a full partitioning of the 
20 work area, or they can cover only a portion of the whole work area. An exam- 
ple of the latter case is that only the locations listed in the calibration data D 
(with a desired accuracy) are considered. The result of the probabilistic loca- 
tion estimation can now be reported either 

1. By giving the full probability distribution over the areas, ie, for each area X, 
25 give the corresponding probability P(X | V,F). 

2. By giving the most probable subarea X with respect to the distribution P(X | 
V,F). 

3. By giving a point estimate minimizing the expected value of some error 
function with respect to the distribution P(X | V,F). 

30 An example of alternative 3 is the mean squared error, in which 

case the point estimate is the weighted average of the centre points of the su- 
bareas (assuming that the subareas are of equal size), the weights being the 
probabilities P(X | V,F). If the subareas X are not of equal size, the weights 
can be rescaled with respect to the relative size of the corresponding subarea, 

35 for example by multiplication. 
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Uncertainty about the receiver's location can be reduced by prior in- 
formation PI, if available, and/or the observation history OH. Let us assume 
that the above alternative 1 was chosen initially. In other words, the user or 
application requesting the location of the receiver is reported the full probability 
5 distribution. The probability distribution may indicate a number of feasible loca- 
tions. The prior information PI, if available, may indicate that only one of the 
locations is possible, given the received cell identifiers or the like. Alternatively, 
the observation history OH can be used to exclude some locations. For exam- 
ple, although a number of locations could explain the receiver's current loca- 
10 tion, only a subset of the locations can explain the entire observation history 
OH, given the receiver's finite speed. 

Performance examples 

Example 1: Location estimation with the Naive Bayes model. 

The subareas X under consideration are the locations where the 

15 calibration data was collected. The radius of the locations is assumed to be 
one meter, although any unit can be used. The observational variables V are 
taken to be discrete with m values. The value of m can be a constant (eg 3), or 
it can be optimized by using one of the weighting functions described above. 
The boundary points between the intervals can be determined so that the 

20 number of training samples within each interval is the same (equal-frequency 
discretization), or alternatively, the intervals can be made of equal width 
(equal-width discretization). The intervals can also be determined by using a 
clustering algorithm, such as the K-means algorithm. 

One model structure M is used: the observational variables Vi,...,V n 

25 are assumed to be independent, given the value of the location variable X. 
The data is assumed to be independently and identically distributed (= "i.i.d."), 
and follow the Multinomial distribution with the assumptions described in [Gei- 
ger and Heckerman, 1998]. Prior information is non-existent. A non- 
informative uniform prior distribution for the model parameters is used. 

30 Alternatives are discussed in [Kontkanen et al, 2000]. The distribution P(X,V | 
D,M) is computed by integrating over the parameters. With the above 
assumptions, this can be done as described in [Kontkanen et al., 2000]. 

In this experiment, the observation history OH is taken into account 
by treating the eight (other numbers are equally possible) last signal meas- 

35 urements as a single measurement vector V so that the eight individual meas- 
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urements are assumed to be independent of each other. The result is given as 
a point computed as a weighted average of the centre points of the subareas, 
where the weight for area X is P(X | V,D,M). 

This method was implemented and tested empirically in downtown 
5 Helsinki by exploiting the signal strengths of Sonera GSM network. The work 
area was approximately 400 x 500 meters in size, and the calibration data was 
collected outside in the streets in approximately 50 evenly distributed points. 
The average distance between two measurement locations was approximately 
50 meters. The system was tested by using the location estimator in 300 ran- 
10 domly situated locations within the work area. The average location error in 
this test was 42 meters. 

Example 2: Location estimation with a Mixture of Histograms model. 

The location variable X is taken to consist of two coordinates (one 
or three coordinates are also possible) on a fine-grained, discrete scale. The 

15 resolution of the scale can be, for instance, one meter. The observational vari- 
ables Vi, .... V n are taken to be discrete with the maximum resolution deter- 
mined by the measuring device, e.g. 1 dBm. We denote the combination of Vi, 
V n by V. Missing values are replaced by a value which is smaller than any 
possible observable value. Several models are considered. Each model M w is 

20 associated with parameters k, I, and 9 W whose semantics is described below. 

Figure 5 illustrates the structure of model M k i. The value of variable 
Xk is obtained from the value of variable X via discretization into k values. The 
conditional distribution of variables Vi(l), V n (l) given the value of Xr is de- 
scribed by model parameters 0 k i . Each Vj, where i belongs to the set {1 , n}, 

25 is uniformly distributed within the interval defined by the value of variable Vj(l). 
A low-resolution location variable Xk is derived from the fine-grained location 
variable X by discretizing the latter with equal width discretization (other discre- 
tization methods also possible) using k bins, ie k possible values. Whenever a 
boundary point of the low-resolution discretization appears between two 

30 boundary points of the fine-grained discretization, the mass (i.e. the number of 
observations within the sub-interval) is divided according to the relative size of 
the overlapping intervals. For instance, let the fine-grained discretization have 
5 bins (4 boundary points) within the range [0, 10]. Let the low-resolution dis- 
cretization have 2 bins, and therefore one boundary point at the value 5. If 

35 there are n observations within the range [4, 6], i.e. within the third bin of the 
fine-grained discretization, then both low-resolution bins get n/2 observations, 
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because the boundary point 5 splits the range [4, 6] into two parts of equal 
size. Likewise, each observational variable V| is discretized using I possible 
values, thus obtaining a low-resolution variable Vj(l). 

Model Mw describes the conditional probability functions P(V(I) | Xk, 
5 M k i, 9 k i), where 9 k i denotes the model parameters of model Mw. The low- 
resolution observational variables V^l), V„(l) are taken to be independent, 
given the value of the location variable Xk. For each i belonging to the set {1, 
.... n}, the distribution P(V|(I) | Xk, M w , e w ) is taken to be i.i.d., and follow the 
Multinomial-Dirichlet distribution with the assumptions described in [Geiger 

10 and Heckerman, 1998]. Prior information is nonexistent. A uniform prior distri- 
bution over models Mm is used. A non-informative equivalent sample size 
(ESS) prior distribution for the model parameters is used as described in 
[Heckerman, 1995]. A second-order prior for the ESS parameter is used, e.g., 
a uniform distribution over the set {1 , 10}. For each i belonging to the set {1 

15 n}, the distribution P(V|(I) | Xk, M w ) is computed by integrating over the model 
parameters. With the above assumptions, this can be done as described in 
[Kontkanen et al., 2000]. 

The distribution P(V| | Vj(l)) is taken to be uniform over the interval 
defined by the value of Vj(l) and the discretization of V| defined by parameter I. 

20 For instance, let the range of Vj be [0, 10], let the value of I be 5, and let the 
value of Vj(l) be 2. Assuming that equal width discretization is used, the values 
of V, are discretized into five intervals, [0, 2], [2, 4], [4, 6], [6, 8], and [8, 10]. 
Now, given that the value of V|(I) is 2, the distribution P(Vj | Vj(l)) is uniform 
over the interval [2, 4]. The variables Vi V„ are taken to be independent of 

25 each other given the values of variables Vi(l) V n (l). 

Combining the two distributions P(V(I) | Xk, M M ) and P(V | V(l)), we 
obtain a conditional distribution P(V | Xk, M w ). The distribution P(V | X, D) is 
computed as a weighted average over the models Mw, where k and I vary over 
the set {1, .... 20} (other choices equally possible). The models are weighted 

30 by the marginal likelihood P(V(D) | X k (D), M M ), where the calibration data (with 
n observations) consists of the vectors V(D) = (V 1 (D), .... V n (D)) and Xk(D) = 

(Xk 1 (D) X k n (D)). 

With these assumptions, the marginal likelihood can be computed 
efficiently in two parts: First, the product of the terms with the form P(V(I) | X k , 

35 Mm) can be computed as described in [Heckerman, 1995] and [Geiger and 
Heckerman, 1998]. Second, the terms with the form P(V | V(l)) have the same 
value, which is a constant depending on I, because the distribution P(V | V(l)) 
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is uniform. The result is given as a posterior probability distribution P(X | V, D) 
= P(V | X, D) P(X | D) / P(V | D) over the location variable X. The distribution 
P(X | D) is taken to be uniform. The term P(V | D) is a normalizing factor 
whose value is ignored. Instead the resulting distribution P(X | V, D) is normal- 

5 ized so that it sums up to one. 

The method described above was implemented and tested empiri- 
cally in Helsinki on the second floor of the building at address Teollisuuskatu 
23 by using a laptop computer measuring the WLAN signal strengths through 
a WLAN PC card. The work area was approximately 20 x 45 meters (900 

10 square meters) in size. Calibration data was collected in 12 arbitrary places, 
and the total number of data vectors in it was 204. The system was tested by 
using the location estimator in 25 randomly chosen locations within the work 
area. Location estimation was repeated five times in each location. When the 
above system was used for determining the location area with 95% of the 

15 probability mass, the correct place was in this area 77 % of the time. The av- 
erage size of the 95 % probability mass area was approximately 151 square 
meters, i.e. about 17 % of the total area. 

References: 

Cowell, R., Dawid PA, Lauritzen S., Spiegelhalter D: Probabilistic 
20 Networks and Expert Systems, Springer, New York, 1999. 

Geiger, D. and Heckerman, D: Parameter Priors for Directed Asycfic 
Graphical Models and Characterization of Several Probability Distributions, 
Techical Report MSR-TR-98-67, Microsoft Research, December 1998. 

Heckerman D., A Tutorial on Learning with Bayesian Networks, 
25 Technical Report MSR-TR-95-06, Microsoft Research, 1995. 

Kontkanen, P., Myllymaki, P., Silander, T., Tirri, H., and GrQnwald, 
P: On Predictive Distributions and Bayesian Networks, Statistics and 
Computing 10 (2000), p. 39 - 54. 

Kontkanen, P., Myllymaki, P., Silander, T. and Tirri, H: On 
30 Supervised Selection of Bayesian Networks, Proceedings of the 15th 
International Conference on Uncertainty in Artificial Intelligence (UAI'99), 
Laskey, K. and Prade, H., 1999, Morgan Kauffmann, p. 334 - 342. 

Pearl, J: Probabilistic Reasoning in Intelligent Systems: Networks of 
Plausible Inference, Morgan Kaufmann Publishers, San Mateo, CA, 1988. 
35 Rissanen, J: Hypothesis Selection and Testing by the MDL Princi- 

ple, Computer Journal 42 (1999) 4, p. 260-269. 



WO 02/054813 



PCT/FI01/01151 



19 

Wallace, C.S. and Dowe, D.L., Minimum Message Length and Kol- 
mogorov Complexity, Computer Journal 42 (1999) 4, p. 270-283. 



All references are incorporated herein by reference. 



WO 02/054813 



PCT/FI01/01151 



20 

Claims 

1 . A method for estimating a location (X) of a receiver (R, R') in a 
wireless telecommunication environment (RN), the telecommunication envi- 
ronment comprising a plurality of channels for simultaneous communication, 

5 each channel having at least one signal parameter (V) that varies with location 
(X) differently from the other channels; 

characterized in that the method comprises the steps of: 
for each of a plurality of calibration points in the wireless telecom- 
munication environment, determining a set of calibration data (CD), each set 
10 of calibration data comprising the location (X) of the respective calibration 
point and at least one measured signal parameter (V) for each of several 
channels at that calibration point; 

on the basis of the sets of calibration data (CD), maintaining a sta- 
tistical model (SM) of the signal parameters (V) of the several channels versus 
15 a receiver's location in the wireless telecommunication environment (RN); 

determining a set of observed signal parameters (CO), the set com- 
prising at least one observed signal parameter (V) for each of several chan- 
nels at the location (X) of the receiver (R, R'); and 

determining a location estimate (LE) approximating the location (X) 
20 of the receiver (R, R') on the basis of the statistical model (SM) and the set of 
observed signal parameters (CO). 

2. A method according to claim 1, characterized by the re- 
ceiver (R) sending the set of observed signal parameters (CO) to an external 
location estimation module (LEM) which sends the location estimate (LE) to 

25 the receiver. 

3. A method according to claim 1, characterized by the re- 
ceiver (R*) storing a copy of the statistical model (SM) and determining the lo- 
cation estimate (LE) on the basis of the copy of the statistical model (SM). 

4. A method according to any one of the preceding claims, cha- 
30 racterized by maintaining the statistical model (SM) also on the basis of 

prior information (PI) about the wireless environment's (RN) infrastructure. 

5. A method according to any one of the claims 1 to 4, characte- 
rized in that the statistical model (SM) is or comprises a probabilistic model, 
preferably a Bayesian model. 



WO 02/054813 



PCT/FI01/01151 



21 

6. A method according to claim 5, characterized in that the sta- 
tistical model (SM) is or comprises a Bayesian network model. 

7. A method according to any one of the preceding claims, cha- 
racterized in that the signal parameters (V) in the statistical model (SM) 

5 are independent of each other, given the location (X). 

8. A method according to any one of the preceding claims, cha- 
racterized by reducing uncertainty concerning the receiver's location on the 
basis of a history (OH) of the observed signal parameters. 

9. A method according to any of the preceding claims, characte- 
10 rized by modelling at least some of the signal parameters (V) by discrete 

variables whose values correspond to intervals or unions of intervals on the 
range of possible signal parameter values. 

10. A method according to any of the preceding claims, charac- 
terized by modelling the location (X) as a discrete variable. 

15 11. A location estimating apparatus (LEM) for estimating a location 

(X) of a receiver (R, R') in a wireless telecommunication environment (RN), the 
telecommunication environment comprising a plurality of channels for simulta- 
neous communication, each channel having at least one signal parameter (V) 
that varies with location (X) differently from the other channels; 

20 characterized by: 

a model construction module (MCM) for 

- receiving a set of calibration data (CD) for each of a plurality of 
calibration points in the wireless telecommunication environment, each set of 
calibration data comprising the location (X) of the respective calibration point 

25 and at least one measured signal parameter (V) for each of several channels 
at that calibration point; and 

- maintaining, on the basis of the sets of calibration data (CD), a 
statistical model (SM) of the signal parameters (V) of the several channels 
versus a receiver's location in the wireless telecommunication environment 

30 (RN); 

and a location calculation module (LCM) for: 

- receiving a set of observed signal parameters (CO), the set com- 
prising at least one observed signal parameter (V) for each of several chan- 
nels at the location (X) of the receiver (R, R'); and 



WO 02/054813 



PCT/FI01/01151 



22 

- determining a location estimate (LE) approximating the location 
(X) of the receiver (R, R') on the basis of the statistical model (SM) and the set 
of observed signal parameters (CO). 

12. A receiver (R, R') comprising means for determining sets of ob- 
5 served signal parameters (CO), each set comprising at least one observed 

signal parameter (V) for each of several channels at the location (X) of the re- 
ceiver (R), characterized by means for conveying the sets of observed 
signal parameters (CO) to a location calculation module (LCM) for determining 
a location estimate (LE) approximating the location (X) of the receiver (R) on 
10 the basis of said sets and a statistical model (SM) of the signal parameters (V) 
of the several channels versus a receiver's location in a wireless telecommuni- 
cation environment (RN). 

13. A receiver (R') according to claim 12, characterized by 
comprising the location calculation module (LCM). 

15 14. A receiver (R) according to claim 12, characterized in that 

the means for conveying the sets of observed signal parameters comprises 
means (Rl) for conveying the sets to an external location calculation module 
(LCM). 

15. A receiver according to any one of claim 12 to 14, characte- 
20 rized in that at least some of the sets of observed signal parameters (CO) 
relate to networks the receiver is not attached to. 
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